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Abstract 

This article presents simple and easy proofs of the Implicit Function 
Theorem and the Inverse Function Theorem, in this order, both of them 
on a finite-dimensional Euclidean space, that employ only the Interme- 
diate Value Theorem and the Mean- Value Theorem. These proofs avoid 
compactness arguments, the contraction principle, and fixed-point theo- 
rems. 
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1 Introduction. 

The objective of this paper is to present very simple and easy proofs of the 
Implicit and Inverse Function theorems, in this order, on a finite-dimensional 
Euclidean space. Following Dini's approach, these proofs do not employ com- 
pactness arguments, the contraction principle or any fixed-point theorem. In- 
stead of such tools, the proofs presented in this paper rely on the Intermediate 
Value Theorem and the Mean- Value Theorem on the real line. 

The history of the Implicit Function Theorem is quite long and dates back 
to R. Descartes (on algebraic geometry), I. Newton, G. Leibniz, J. Bernoulli, 
and L. Euler (and their works on infinitesimal analysis), J. L. Lagrange (on real 
power series), A. L. Cauchy (on complex power series), to whom is attributed 
the first rigorous proof of such result, and U. Dini (on functions of real variables 
and differential geometry) . For extensive accounts on the history of the Implicit 
Function Theorem (and the Inverse Function Theorem) and further develop- 
ments (as in differentiable manifolds, Riemannian geometry, partial differential 
equations, numerical analysis, etc.) we refer the reader to Krantz and Parks 
[T3], Dontchev and Rockafellar [7, pp. 7-8, 57-59], Hurwicz and Richter [TTj . 
Iurato [H], and Scarpello [15] . 

Complex variables versions of the theorems studied in this paper can be seen 
in Burckel [4, pp. 173-174, 180-183] and in Krantz and Parks [14, pp. 27-32, 
117-121]. See also de Oliveira [5, p. 17]. 



1 



The two most usual approaches to the Implicit and Inverse Function theo- 
rems on a hnitc-dimcnsional Euclidean space begin with a proof of the Inverse 
Function Theorem and then derive from it the Implicit Function Theorem. The 
most basic of these approaches employs the compactness of the bounded and 
closed balls inside R™, where n > 1, and Weierstrass' Theorem on Minima. See 
Apostol [1, pp. 139-148], Bartle [2, pp. 380-386], Fitzpatrick [8, pp. 433-453], 
Knapp [13, pp. 152-161], Rudin [17, pp. 221-228], and Spivak [19, pp. 34- 
42] . The more advanced approach is also convenient on complete normed vector 
spaces (also called Banach spaces) of infinite dimension, employs the contraction 
mapping principle (also called the fixed-point theorem of Banach), and besides 
others applications is very useful in studying some types of partial differential 
equations. See Dontchev and Rockafellar [7, pp. 10-20], Gilbarg and Trudinger 
[9, pp. 446-448], Renardy and Rogers [16, pp. 336-339], and Taylor [20, pp. 
11-13]. 

In this article, we prove at first the Implicit Function Theorem, by induction, 
and then we derive from it the Inverse Function Theorem. This approach is 
accredited to U. Dini (1876), who was the first to present a proof (by induction) 
of the Implicit Function Theorem for a system with several equations and several 
real variables, and then stated and also proved the Inverse Function Theorem. 
See Dini [6, pp. 197-241]. 

Another proof by induction of the Implicit Function Theorem, that also 
simplifies Dini's argument, can be seen in the book by Krantz and Parks [14, 
pp. 36-41]. Nevertheless, this particular proof by Krantz and Parks does not 
establish the local uniqueness of the implicit solution of the given equation. 
On the other hand, the proof that we present in this paper further simplifies 
Dini's argument and makes the whole proof of the Implicit Function Theorem 
very simple, easy, and with a very small amount of computations. The Inverse 
Function Theorem then follows immediately 

2 Notations and Preliminaries. 

Let us indicate by R the complete field of the real numbers, and by R n , 
where n > 1, the n-dimensional Euclidean space. If a and b are real numbers, 
with a < b, we set ]a, b[= {x in R : a < x < b} and [a, b] = {x in R : a < x < b}. 
We assume without proof the following two results. 

• The Intermediate Value Theorem: Let / : [a, b] — >• R be continuous. If A is a 
value between /(a) and /(&), then there is a c in [a, b] satisfying /(c) = A. 

• The Mean- Value Theorem: Let / : [a, b] — > R be continuous on [a, b] and 
differentiable on the open interval ]a, b[. Then, there exists c in ]a,b[ 
satisfying /(&) - f(a) = f(c)(b- a). 

The first complete proof of the Intermediate Value Theorem was given by B. 
Bolzano (1817), who was also the first to enunciate the supremum property 
of the real numbers, see Liitzen [15, p. 175] and Hairer and Wanner [10, p. 
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206]. The Mean- Value Theorem is attributed to Lagrange (1797), see Hairer 
and Wanner [10, p. 240]. 

In what follows we fix the ordered canonical bases {ei, . . . , e„} and {fi, ■ ■ ■ , f m }, 
of R™ and R™ 1 , respectively. Given x = X\e\ + ■ ■ ■ + x n e n in R™, where 
real numbers, we write x = (x\, . . . , x n ). If x = (x\, . . . , x n ) 
and y = (j/i, . . . , y„) are in R™, then their inner product is defined by (x, y) = 
Xiyi + ■ ■ ■ + x n y n . The norm of x is \x\ = \/ x\ + ■ ■ ■ + x\ = (x, x). It 
is well-known the Cauchy-Schwarz inequality | (x, y) \ < \x\\y\, for all x and 
all y, both in 1™. Moreover, given a point x in R" and r > 0, the set 
B(x; r) — {y in R™ : \y — x\ < r} is the open ball centered at x with radius r. 

Given a linear function T : R™ — > R m , there are real numbers a^ , where 
i = 1, . . . , m and j = 1, . . . , n, such that T(ej) = ay/i + • • • + a m jf m , for each 
j = 1, . . . , n. We associate to T the m x n real matrix 



/an ... ai„ \ 



\ fl ml • • • ^mn / 



— ( a ij)l<i<m- 
l<j<n 



The jth column of the matrix (a^) supplies the m coordinates of T(ej). Fur- 
thermore, given a m x n real matrix M — (ay) we associate with it the linear 
function T : R™ ->• R m defined by T(ej) = (ay, . . . , a mj -), for all j = 1, . . . , m. 
If v belongs to R™, for simplicity of notation we also write Tv for T(v). The 



Hilbert-Schmidt norm of T is the number j|T|| = ^J^2T=i l a 



«j I 2 - 



Lemma 1 Let T : R™ -)• R m 6e a linear function. We have \Tv\ < \\T\\\v\, for 
all v in R n , and T is continuous. 

Proof. Let (ay) be the matrix associated to T. Given v in R", we have \Tv\ 2 = 
Y^iLi I (( a n> ■ ■ ■ >Q-in),v) | 2 . By employing the Cauchy-Schwarz inequality we 
obtain \Tv\* < J2T=i \(an, . . . , a in )\ 2 \v\ 2 = \v\ 2 £™ 1 E?=i \<Hj ? = \\T\\ 2 \v\ 2 . 
Thus, \Tv\ < \\T\\\v\. Finally, from the inequality \T(v + h) - T(v)\ = \Th\ < 
\\T\\\h\, for all v and all h, both in R", it follows that T is continuous on R"B 

Let ft be an open set in R™. Given a function F : il — > R m and p in 0, 
we write F{p) — (i*\(p), . . . , F m (p)), with Fi : £1 — s> R the ith component of F, 
for each i = 1, . . . , m. We say that F is diffcrcntiable at p if there is a linear 
function T : R" ->• fl m and a function £ : B(0; r) R m defined on some 
ball £>(0;r), with r > 0, centered at the origin of R™ such that F(p + h) = 
F(p) + T(h) + E(h), for all h in B(0;r), where E(h)/\h\ as h ->• 0. The 
function F is differentiable if it is differentiable at every point in fl. 

The linear map T is the differential of F at p, denoted by DF(p). Suppos- 
ing that t is a real variable and defining for each j — 1, . . . ,n the directional 
derivative of F at p in the direction ej as ^ (p) = lim F ( p+te i)~ F (P) j we deduce 

that DF(p)( ei ) = g(p) = (f&(p), • . • , ^(p)). Putting f|(p) = ||(p), for 
alii = 1, . . . ,m and all j = 1, . . . ,n, we call §§-{p) = (f^Cp), • • • , ^(p)) the 
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jth partial derivative of F at p. The Jacobian matrix of F at p is 



( 




) 





JF(p) 



1 < i < rn 
l<j<n 




We say that F is of class C 1 if F and its partial derivatives of order 1 are 
continuous on fi. In such case, we also say that F is in C 1 (f2;R n ). 

The following lemma is a local result and valid on an open subset fl of R™. 
For practicality, we enunciate it for f2 = R n . 

Lemma 2 Let F : R" -> R m fee differentiable, T : R fe -4 R™ fee i/ie Zinear 
function associated to a n x k real matrix M, and y be a fixed point in R™. 
Then, the function G(x) = F(y + Tx), where x is in R fc , is differentiable and 
satisfies JG{x) = JF(y + Tx)M, for all x in R k . 

Proof. Let us fix a; in R fe . Given v in R™, by the differentiability of F we have 
F(y + Tx + v) = F(y + Tx) + DF(y + Tx)v + E(v), where E(v)/\v\ ^ as 
v — > 0. Substituting v = Th, where h is in R fe , into the last identity we obtain 
G(x + h) = G(x) + DF(y + Tx)Th + E(Th). Thus, supposing h ^ 0, we have 



= 0, if Th = 0, and ^ff- = ^gi^i, if Th / 0. Since Th -> as 



h ->■ and, by Lemma 1, \Th\/\h\ < \\T\\, we conclude that E(Th)/\h\ goes to 
as h goes to 0. Hence, G is differentiable at x and JG(x) — JF(y + Tx)MU 

With the hypothesis on Lemma [21 we see that if F is C 1 then G also does. 
Given a and fe, both in R™, we put ab = {a + t(b — a) : < t < 1}. 

Lemma 3 (The Mean- Value Theorem in Several Variables). Let us 

consider F : f2 — > R differentiable, with open in R™. Let a and b be points in 
Q such that the segment ab is within CI. Then, there exists c in ab satisfying 



Proof. The curve 7(i) = a + t(b — a) , with t in [0, 1], is inside fi. Employing the 
mean- value theorem in one real variable and Lemma[2]we obtain F(b) — F(a) = 
(Fo 7 )(l)-(Fo 7 )(0) = (Fo 7 )'(i ) = <VF( 7 (i )),fe-a>, for some t in [0,1]B 

Lemma 4 Let F be in C 1 (r2; R"), with CI open within R", and p in ft satisfying 
detJF(p) ^ 0. Then, F restricted to some ball B(p;r), with r > 0, is injective. 

Proof. (See Bliss [3]) Since F is of class C 1 and the determinant function 
dct : R" 2 ^ R is continuous and det JF(p) = det (ff^)) + 0, there is r > 
such that det (§f-K£y )) does not vanish, for all £y in B(p; r), where 1 < i, j < n. 

Let a and 6 be distinct in B(p;r). By employing the mean- value theorem 
in several variables to each component Fi of F, we find Ci in the segment ab, 



F(b)-F(a) = (VF(c),b-a). 
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within B(p; r), such that F^b) - Fi{a) = (VF,(c,), b - a). Hence, 



/ F 1 (6)-F 1 (a) 



) 




) 



/ h-ai \ 



V F n (b)-F n (a) 




\ b n -a n ) 



Since dct (f^(ci)) ^ and 6 - a ^ 0, we conclude that F(6) ^ F(a)B 

3 The Implicit and Inverse Function Theorems. 

The first implicit function result we prove regards one equation and several 
variables. We denote the variable in R™ +1 = 1™ x 1 by (a;, y), where x = 
(x\, . . . , x n ) is in R n and y is in R. 

Theorem 1 Let F : 51 — ^ R be of class C 1 in an open set £1 inside R n x R and 
(a, b) be a point in such that F(a, b) = and §^(a, 6) > 0. Then, there exist 
an open set X , inside R" and containing a, and an open set Y , inside R and 
containing b, satisfying the following. 

• For each x in X there is a unique y — f(x) in Y such that F(x, f(x)) = 0. 

• We have /(a) = b. Moreover, f : X — > Y is of class C 1 and 



Proof. Let us split the proof into three parts. 

o Existence and Uniqueness. Since ^(a,b) > 0, by continuity there exists a 
non-degenerate (n + l)-dimensional parallelepiped X' x [61,62], centered 
at (a, b) and contained in fi, whose edges are parallel to the coordinate 
axes such that ^ > on X' x [61, 62]- Then, the function F{a, y), where 
y runs over [61,62], is strictly increasing and F(a,b) = 0. Thus, we have 
F(a, 61) < and F(a,b 2 ) > 0. By the continuity of F, there exists an 
open non-degenerate n-dimcnsional parallelepiped X, centered at a and 
contained in X', whose edges are parallel to the coordinate axes such that 
for every x in X we have F(x,bi) < and F{x,b2) > 0. Hence, fixing 
an arbitrary x in X and employing the intermediate value theorem to the 
strictly increasing function F(x,y), where y runs over [61,62], yields the 
existence of a unique y — f(x) inside Y =]6i, b 2 { such that F(x, f(x)) = 0. 

o Continuity. Let 61 and 62 be such that 61 < 61 < 6 < 6 2 < 6 2 . From above, 
there exists an open set X" ', contained in X and containing a, such that 
f(x) is in ]6i,6 2 [, for all x in X" . Thus, / is continuous at x = a. Now, 




for all x in X, where j = 1, . . . , n. 
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given any a' in X, we put b' = f(a'). Then, / : X — > Y is a solution of 
the problem F(x, h(xj) = 0, for all x in X, with the condition h(a') = b' . 
Thus, from what we have just done it follows that / is continuous at a'. 

o Differentiability. Given x in X, let ej be jth canonical vector in R n and 
t ^ be small enough so that x + tej belongs to X. Putting P = (x, f(x)) 
and Q = (x+tej, f(x+tej)) , we notice that F vanishes at P and Q. Thus, 
by employing the mean- value theorem in several variables to F restricted 
to the segment PQ within Xx}bi,b 2 [, we find a point (x,y) inside PQ 
and satisfying 

= F(x + tej, f{x + tej)) - F(x, f(x)) = 

Since and ^ are continuous, with ^ not vanishing on Xx\b\,b2[, 
the function / is continuous and (x,y) — > (x, f{x)) as t — > 0. Thus, 

fix + te^-fjx) ggj gKfW) 

^ f(^y) f (*>/(*)) 

This gives the desired formula for and implies that / is of class C 1 ■ 

Next, we prove the general implicit function theorem. In general, we apply 
this theorem when we have a nonlinear system with m equations and n+m vari- 
ables. Analogously to a linear system, we interpret n variables as independent 
variables and determine the remaining m variables, called dependent variables, 
as a function of the n independent variables. 

Let us introduce some helpful notation. As before, we denote by x = 
(xi,...,x n ) a point in R™ and by y — (yi, . . . , y m ) a point in M. m . Given fl 
an open subset of R™ x R™ 1 and a differentiable function F : ft — > R m , we write 
F = (F\, . . . , F m ), with Fi the ith component of F and i = 1, . . . ,m. We put 



OF _ /dFV 
dy \dyj J i<i<m 



dFi _ _ _ dF ± 

dy m 



V 



dF m _ _ _ dF\r 

dyi dym 



l<j<m 

Analogously, we define the matrix ^ = ) , where 1 < i < m and 1 < k < n. 



Theorem 2 (The Implicit Function Theorem). Let F be in C^fijR" 1 ), 
with Q, an open set in R" x R m , and (a, b) a point in il such that F(a, b) = and 
^(a, b) is invertible. Then, there exist an open set X, inside R" and containing 
a, and an open set Y, inside R m and containing b, satisfying the following. 

• For each x in X, there is a unique y = f(x) in Y such that F(x, f(x)) = 0. 
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We have f(a) = b. Moreover, f : X — > Y is of class C 1 and 



Jf(x) 



dF 



mxm 



, for all x in X. 



Proof. We split the proof into four parts. 

o Finding Y. Denning $(x, y) — (x,F{x,y)), where (x, y) is in 0, we have 

dF 



det J$ = det 



I 









dx 


dy . 



det- , 
dy 



with / the identity matrix of order n and the n x to zero matrix. Thus, 
shrinking ft if needed, by Lemma 4 we may and do assume, without any 
loss of generality, that $ is injective and £1 = X' x Y, with X' an open 
set in M™ that contains a and Y an open set in M m that contains b. 

o Existence and differentiability. We claim that the equation F(x, f(x)) = 0, 
with the condition /(a) = b, has a solution f = f(x) of class C 1 on some 
open set containing a. Let us prove it by induction on m. 

The case m = 1 follows from Theorem 1. Let us assume that the claim 
is true for to — 1. Then, for the case m, following the notation F = 
(Fx, . . . , F m ) we write T = (F2, . . . , F m ). Furthermore, we put (x; y) = 
(xx,...x n ;yx, ■ ■ .,y m ), y' = (2/2, • • -,y m ), y = {yv,y'), and (x;y) = (x;yx;y'). 

Let us consider the invertible matrix J — ^(0,6) and the associated 
bijective linear function J : M. m R m . By Lemma 2, the function 
G(x; z) = F[x;b + J- X (z - &)], dehncd in some open subset of R" x R m 
that contains (a, b), satisfy ^(a; b) = J J -1 and the condition G(a; b) = 0. 
Hence, we may assume that J is the identity matrix of order to. 

Now, let us consider the equation Fx(x;yx;y') = with the condition 
2/1 (a; b') = bx- Since &i; 6') = 1, there exists a function j/i = ip(x; y') 

of class C 1 on some open set containing (a; 6') that satisfies 

Fx[x; ip(x; y'); y'\ — and the condition ip(a; b') = bx- 

Next, substituting yx = f(x; y') into T[x\ yx',y') = 0, we look at solving 

the equation T[x; <p(x, y'); y'] — 0, with the condition y'(a) = b' . 

Differentiating T[x; ip(x; y'); y'], with respect to j/2, ■ • ■ , y m , w e find 

dFi dip dFi dFi 

l^( a]b ^dy~( a ' b ^ + &y : ^ a ' b ^ = ° + ~dy 1 ( a;b ^ where2 ^*'^ TO - 

The matrix ^|p-(a; b)^j, where 2 < i,j < to, is the identity one of order 
to — 1. Hence, by induction hypothesis there exists a function ip of class 
C 1 on an open set X containing a that satisfies 

J r [x;<p(x;i/'(a;)),'0( a; )] = 0, for all x in X, and the condition if>(a) = b' . 
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We also have Fi [x; ip(x; tp(x)) ; tp(x)] = 0, for all x in X . Defining f(x) = 
((fix; ip(x)); ip(x)) , with x in X, we obtain F[x; f(x)] = 0, for all x in X, 
and /(a) = (f(a; b'); b') = (bi;b r ) — b, where / is of class C 1 on X. 



o Differentiation formula. Differentiating F[x,f(x)} = we find 




Thus, we have $(x,g(x)) = [x,F(x 1 g{x)) — (x,0) = $(x, /(x)), for all x 
in X. The injectivity of <f> implies that g(x) = f(x), for all x in XU 

Theorem 3 (The Inverse Function Theorem). Let F : fl — > R™, where 
£1 is an open set in K™, be of class C 1 and p a point in £1 such that JF(p) 
is invertible. Then, there exist an open set X containing p, an open set Y 
containing F(p), and a function G :Y — ^ X of class C 1 that satisfies F(G(y)) = 
y, for all y in Y, and G(F(x)) = x, for all x in X . Moreover, 



Proof. We split the proof into two parts: existence and differentiation formula. 

o Existence. Shrinking 0, if necessary, by Lemma 4 we may assume that F 
is injective. The function $(x, y) = F(x) — y, where (x, y) is in ft x R™, is 
of class C 1 and satisfies ^(F{p),p) = JF(p). From the implicit function 
theorem it follows that there exists an open set Y containing F(p) and a 
function G : Y -> Vt of class C 1 such that <f>(G(y),y) = F(G(y)) -y = 0, 
for all y in Y. That is, we have F(G(y)) = y, for all y in Y. 

Hence, the set Y is contained in the image of F. Since F is continuous 
and injective, the pre-image set X = F _1 (F) is open, contains p, and F 
maps X bijectively onto Y . 

The identity F(G(y)) = y, for all y in Y, implies that G maps Y to X. 
Since F is bijective from X to Y, the function G is bijective from Y to X. 



o Differentiation formula. Let us write F(x) — (F\ (x), . . . , F n (x)) and G{y) = 
(Gi(y),...,G n (y)). Differentiating (Gi(F(x)), . . . , G n (F(x))) wc obtain 



Acknowledgments. I thank Professors Robert B. Burckel, Paulo A. Martin, 
and A. Lymberopoulos for their very valuable comments and suggestions. 



JG(y) = JF(G(y)) \ for ally in Y. 
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